An improved extrinsic monolingual plagiarism detection approach of the Bengali text
نویسندگان
چکیده
Plagiarism is an act of literature fraud, which presenting others’ work or ideas without giving credit to the original work. All published and unpublished written documents are under cover this definition. Plagiarism, increased significantly over last few years, a concerning issue for students, academicians, professionals. Due this, there several plagiarism detection tools software available detect in different languages. Unfortunately, negligible has been done no Bengali language where one most spoken languages world. In paper, we have proposed tool that mainly focuses on educational newspaper domain. We collected 82 textbooks from National Curriculum Textbooks (NCTB), Bangladesh, scrapped all articles 12 reputed newspapers compiled our corpus with more than 10 million sentences. The method text shows accuracy rate 97.31%
منابع مشابه
Monolingual and Crosslingual Plagiarism Detection
Automatic plagiarism detection considering a reference corpus compares a suspicious text to a set of documents in order to relate the plagiarised fragments to their potential source. The suspicious and source documents can be written wether in the same language (monolingual) or in different languages (crosslingual). In the context of the Ph. D., our work has been focused on both monolingual and...
متن کاملA Pairwise Document Analysis Approach for Monolingual Plagiarism Detection
The task of plagiarism detection entails two main steps, suspicious candidate retrieval and pairwise document similarity analysis also called detailed analysis. In this paper we focus on the second subtask. We will report our monolingual plagiarism detection system which is used to process the Persian plagiarism corpus for the task of pairwise document similarity. To retrieve plagiarised passag...
متن کاملAn Effective Approach for Compression of Bengali Text
In this paper, we propose an effective and efficient approach for compressing Bengali Text. This paper focuses on a methodical study on Bengali text compression techniques. The main target of this research is to provide a framework for Bengali text compression; which ensures a simple and computationally inexpensive effective scheme for Bengali text compression. The proposed Bengali text compres...
متن کاملA Novel Approach for Plagiarism Detection in English Text
Digitalization provides text easily available on web interrelated to several academic areas. So it becomes a serious problem for academic enterprises or institutes. This paper presents Plagiarism detection system for the English language. Digital World provides text easily available on web interrelated to several academic areas. So it becomes a serious problem for academic enterprises or instit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Power Electronics and Drive Systems
سال: 2023
ISSN: ['2722-2578', '2722-256X']
DOI: https://doi.org/10.11591/ijece.v13i4.pp4256-4267